NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

CaT-Bench: Benchmarking Language Model Understanding of Causal and Temporal Dependencies in Plans

Lal, Yash Kumar; Cohen, Vanya; Chambers, Nathanael; Balasubramanian, Niranjan; Mooney, Ray (November 2024, Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing)

Full Text Available
Handling Open-Vocabulary Constructs in Formalizing Speci- fications: Retrieval-Augmented Parsing with Expert Knowl- edge

Hasan, Mohammad Saqib; Ghosh, Sayontan; Verma, Dhruv; Kuenning, Geoff; Zadok, Erez; Smolka, Scott; Balasubramanian, Niranjan (October 2024, https://openreview.net/forum?id=BgvgMxY8s5)

We study the problem of Open-Vocabulary Constructs (OVCs)—ones not known beforehand—in the context of converting natural language (NL) specifications into formal languages (e.g., temporal logic or code). Mod- els fare poorly on OVCs due to a lack of necessary knowledge a priori. In such situations, a domain expert can provide correct constructs at in- ference time based on their preferences or domain knowledge. Our goal is to effectively reuse this inference-time, expert-provided knowledge for future parses without retraining the model. We present dynamic knowledge- augmented parsing (DKAP), where in addition to the input sentence, the model receives (dynamically growing) expert knowledge as a key-value lexicon that associates NL phrases with correct OVC constructs. We pro- pose ROLEX, a retrieval-augmented parsing approach that uses this lexicon. A retriever and a generator are trained to find and use the key-value store to produce the correct parse. A key challenge lies in curating data for this retrieval-augmented parser. We utilize synthetic data generation and the data augmentation techniques on annotated (NL sentence, FL statement) pairs to train the augmented parser. To improve training effectiveness, we propose multiple strategies to teach models to focus on the relevant subset of retrieved knowledge. Finally, we introduce a new evaluation paradigm modeled after the DKAP problem and simulate the scenario across three formalization tasks (NL2LTL, NL2Code, and NL2CMD). Our evaluations show that DKAP is a difficult challenge, and ROLEX helps improve the performance of baseline models by using dynamic expert knowledge effectively.
more » « less
Full Text Available
Handling Open-Vocabulary Constructs in Formalizing Specifications: Retrieval Augmented Parsing with Expert Knowledge

Hasan, Mohammad_Saqib; Ghosh, Sayontan; Verma, Dhruv; Kuenning, Geoff; Zadok, Erez; Smolka, Scott; Balasubramanian, Niranjan (October 2024, Conference on Language Modeling (COLM))

Full Text Available
Look Hear: Gaze Prediction for Speech-Directed Human Attention

https://doi.org/10.1007/978-3-031-72946-1_14

Mondal, Sounak; Ahn, Seoyoung; Yang, Zhibo; Balasubramanian, Niranjan; Samaras, Dimitris; Zelinsky, Gregory; Hoai, Minh (October 2024, Lecture notes in computer science)

Full Text Available
The Times They Are A-Changin': Characterizing Post-Publication Changes to Online News

Tsoukaladelis, Chris; Kondracki, Brian; Balasubramanian, Niranjan; Nikiforakis, Nick (May 2024, IEEE Symposium on Security and Privacy)

Full Text Available
PASTA: A Dataset for Modeling Participant States in Narratives

Ghosh, Sayontan; Koupaee, Mahnaz; Chen, Isabella; Ferraro, Francis; Chambers, Nathanael; Balasubramanian, Niranjan (March 2024, Transactions of the Association for Computational Linguistics)

The events in a narrative are understood as a coherent whole via the underlying states of their participants. Often, these participant states are not explicitly mentioned, instead left to be inferred by the reader. A model that understands narratives should likewise infer these implicit states, and even reason about the impact of changes to these states on the narrative. To facilitate this goal, we introduce a new crowdsourced English-language, Participant States dataset, PASTA. This dataset contains inferable participant states; a counterfactual perturbation to each state; and the changes to the story that would be necessary if the counterfactual were true. We introduce three state-based reasoning tasks that test for the ability to infer when a state is entailed by a story, to revise a story conditioned on a counterfactual state, and to explain the most likely state change given a revised story. Experiments show that today’s LLMs can reason about states to some degree, but there is large room for improvement, especially in problems requiring access and ability to reason with diverse types of knowledge (e.g. physical, numerical, factual).
more » « less
Full Text Available
AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents

https://doi.org/10.18653/v1/2024.acl-long.850

Trivedi, Harsh; Khot, Tushar; Hartmann, Mareike; Manku, Ruskin; Dong, Vinty; Li, Edward; Gupta, Shashank; Sabharwal, Ashish; Balasubramanian, Niranjan (January 2024, Association for Computational Linguistics)

Full Text Available
Text-Derived Knowledge Helps Vision: A Simple Cross-modal Distillation for Video-based Action Anticipation

Ghosh, Sayontan; Aggarwal, Tanvi; Hoai, Minh; Balasubramanian, Niranjan (April 2023, Findings of the Association for Computational Linguistics: EACL 2023)

Full Text Available
NEUROSTRUCTURAL DECODING: Neural Text Generation with Structural Constraints

https://doi.org/10.18653/v1/2023.acl-long.528

Bastan, Mohaddeseh; Surdeanu, Mihai; Balasubramanian, Niranjan (January 2023, 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers))

Full Text Available
Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions

https://doi.org/10.18653/v1/2023.acl-long.557

Trivedi, Harsh; Balasubramanian, Niranjan; Khot, Tushar; Sabharwal, Ashish (January 2023, 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers))

Full Text Available

« Prev Next »

Search for: All records